A Baseline Speech Recognition System for Levantine Colloquial Arabic
نویسندگان
چکیده
The Arabic language is characterized by the existence of many different colloquial varieties that significantly differ from the standard Arabic form. In this paper, we propose a state-of-the-art speech recognition system for Levantine Colloquial Arabic (LCA). A fully continuous context dependent acoustic model was trained using 50 hours of speech from the BBN DARPA Babylon corpus. Pronunciation modeling was initially grapheme-based due to the absence of diacritic marks in transcriptions. Acoustic model parameters have been optimized including number of senones and Gaussians. In order to improve speech recognition accuracy, a cross-lingual hybrid acoustic and pronunciation modeling approach is proposed, where a MSA phoneme-based acoustic model is adapted using a small amount of LCA speech data. The adapted AM was then combined with the initial grapheme-based model to create a hybrid acoustic model.
منابع مشابه
CRF-based Diacritisation of Colloquial Arabic for Automatic Speech Recognition
Most of the available resources of colloquial Arabic speech are transcribed without diacritics. Those diacritics provide short vowels and other pronunciation information and by omitting them a considerable amount of ambiguity is introduced. In this paper, we propose the use of an automatic diacritisation method as front-end for training of automatic speech recognition systems of colloquial Arab...
متن کاملAn Investigation in Speech Recognition for Colloquial Arabic
This paper describes a study of grapheme-based speech recognition for colloquial Arabic. An investigation of language and acoustic model configurations is carried out to illustrate the differences between colloquial and modern standard Arabic (MSA) on the example of Levantine telephone conversations. The study defines extensive and carefully crafted data sets for different dialects and studies ...
متن کاملDesign and evaluation of a limited two-way speech translator
We present a limited speech translation system for English and colloquial Levantine Arabic, which we are currently developing as part of the DARPA Babylon program. The system is intended for question/answer communication between an English-speaking operator and an Arabic-speaking subject. It uses speech recognition to convert a spoken English question into text, and plays out a pre-recorded spe...
متن کاملColloquialising Modern Standard Arabic Text for Improved Speech Recognition
Modern standard Arabic (MSA) is the official language of spoken and written Arabic media. Colloquial Arabic (CA) is the set of spoken variants of modern Arabic that exist in the form of regional dialects. CA is used in informal and everyday conversations while MSA is formal communication. An Arabic speaker switches between the two variants according to the situation. Developing an automatic spe...
متن کاملDevelopment of a conversational telephone speech recognizer for Levantine Arabic
Many languages, including Arabic, are characterized by a wide variety of different dialects that often differ strongly from each other. When developing speech technology for dialect-rich languages, the portability and reusability of data, algorithms, and system components becomes extremely important. In this paper, we describe the development of a large-vocabulary speech recognition system for ...
متن کامل